Application of Over-complete Blind so Automatic Speech Re
نویسنده
چکیده
Spoken dialogue based information retrieval systems that are used in mobile environments are becoming popular. However, mobile environment is dynamically changing and there exists many interfering signals. These two effects result in degradation in automatic speech recognition (ASR) accuracy and hence, degradation in performance of spoken dialogue based information retrieval systems. One way to improve the speech recognition accuracy is to separate the intended speech signal from the interference signals and use the enhanced speech signals in recognition. In this paper, we describe a technique that we applied for speech signal enhancement. We also provide the relative improvement in recognition accuracy that we obtained by using such enhanced speech signals in an ASR system. For speech signal enhancement, we apply the Over-complete Blind Source Separation (OCBSS) technique that we developed. For ASR, a continuous speech recognizer was used. In this paper, we also compare the recognition accuracy results of another BSS technique that is based on Independent Component Analysis (ICA) – JADE-ICA with OCBSS. The results indicate that as the complexity of signal separation problem increases i.e., close to real scenarios, the OCBSS provides about 30% better relative improvement in recognition accuracy as compared to JADE-ICA.
منابع مشابه
Fuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition
In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...
متن کاملSImplementation of Frequency Domain Approach Using Instantaneous Mixing Auto Recursive for Separation of Speech Signals
In the present work a novel algorithmic rule by taking the speech from two different microphones and separate these speeches by prediction of separating speech mixtures that is predicated on separation matrices is planned. In multitalker applications so as to boost individual speech sources from their mixtures is done by Blind source Separation (BSS) ways. From the previous published works of s...
متن کاملPerceptual evaluation of blind source separation for robust speech recognition
In a previous article, an evaluation of several objective quality measures as predictors of recognition rate after application of a blind source separation algorithm was reported. In this work, the experiments were repeated using some new measures, based on the perceptual evaluation of speech quality (PESQ), which is part of the ITU P862 standard for evaluation of communication systems. The raw...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملContinuous time-frequency masking method for blind speech separation with adaptive choice of threshold parameter using ICA
We propose a novel method for blind speech separation using continuous time-frequency masking. The method is equipped with an adaptive choice of a threshold parameter that is based on utilization of ICA methods. We present a direct application that consists in the speech segregation for automatic transcription of spoken broadcasts disturbed by background music. Experimental results show improve...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002